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ABSTRACT 



Lower bounds are established on the computational complexity of the 
decision problem and on the inherent lengths of proofs for two 
classical decidable theories of logic: the first order theory of 
the real numbers under addition, and Presburger arithmetic ~ the 
first order theory of addition on the natural numbers. There is 
a fixed constant c > such that for every (non-deterministic) 
decision procedure for determining the truth of sentences of real 
addition and for all sufficiently large n, there is a sentence^f 
length n for which the decision procedure runs for more than 2 
steps. In the case of Presburger arithmetic, the corresponding 

en 
bound is 2 2 . These bounds apply also to the minimal lengths of 
proofs for any complete axiomatization in which the axioms are 
easily recognized. 
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1. Introduction and Main Theorems . 

We present some results obtained in the Fall of 1972 on the computational 
complexity of the decision problem for certain theories of addition. In 
particular we prove the following results. 

Let L be the set of formulas of the first-order functional (predicate) 
calculus written using just + and =. Thus, for example, ~[x + y = y + z] 
vx + x = x is a formula of L, and Vxi?y[x + y = y] is a sentence of L. 
Even though this is not essential, we shall sometimes permit the use of the 
individual constants and 1 in writing formulas of L. We assume a finite 
alphabet for expressing formulas of L, so a variable in general is not a 
single atomic symbol but is encoded by a sequence of basic symbols. 

Let ^7 = <N, +> be the structure consisting of the set N= {0, 1, 2, ...} 
of natural numbers with the operation + of addition. Let Th(f7 ) be the 
first-order theory of 47, i.e. the set of all sentences of L which are 
true in 47. For example, Vx Vy[x + y = y + x] is in Th(*7). Presburger 
has shown that Th(f7 ) is decidable [2]. For brevity's sake, we shall call 
Th(41) Presburger arithmetic and denote it by PA. 

Theorem 1 . There exists a constant c > such that for every decision 

procedure (algorithm) AL for PA, there exists an integer n Q so that for every 

n > n„ there exists a sentence F of L of length n for which AL requires more 

~cn 
than 2 computational steps to decide whether F e PA. 

The previous theorem applies also in the case of non-deterministic 
algorithms. This implies that not only algorithms require a super-exponential 
number of computational steps, but also proofs of true statements concerning 



addition of natural numbers are super-exponentially long. Let AX be a system 
of axioms in the language L (or in an extension of L) such that a sentence 
F e L is provable from AX (AX \- F) if and only if F e PA. Let AX satisfy 
the condition that to decide for a sentence F whether F e AX, i.e. whether 
F is an axiom, requires a number of computational steps which is polynomial 
in the length |F| of F. 

Theorem 2 . There exists a constant c > so that for every axiomatization 

AX of Presburger arithmetic with the above properties there exists an integer 

n so that for every n > n there exists a sentence F e PA such that the 

2 en 
shortest proof of F from the axioms AX is longer than 2 . By the length of 

a proof we mean the number of its symbols. 

With slight modifications, Theorem 2 holds for any (consistent) system 
AX of axioms in a language M in which the notion of integer and the operation 
+ on integers are definable by appropriate formulas so that under this 
interpretation, all the sentences of PA are provable from AX. The ordinary 
axioms ZF for set theory have this property. 

The result concerning super-exponential length of proof applies, in this 
more general case, to the sentences of M which are encodings of sentences of 
PA under the interpretation, i.e. to sentences which express elementary 
properties of addition of natural numbers. 

The previous results necessarily involve a cut-point n^(AL) or Uq(AX) 
at which the super-exponential length of computation or proofs sets in. It 
is significant that a close examination of our proofs reveals that n Q (AL) 
= 0(|AL|) and n (AX) = 0(|AX|). Thus computations and proofs become very 
long quite early in the game. 



The theory PA of addition of natural numbers is one of the simplest most 
basic imaginable mathematical theories. Unlike the theory of addition and 
multiplication of natural numbers, PA is decidable. Yet any decision procedure 
for PA is inherently difficult. 

Let us now consider the structure A = <R, +> of all real numbers R with 
addition. The theory Th(#) (in the same language L) is also decidable. In 
fact, to find a decision procedure for Th(jft) is even simpler than a procedure 
for PA; this is mainly because K is a divisible group without torsion. Yet 
the following holds. 

Theorem 3 . There exists a constant d > so that for the theory Th(j&) 
of addition of real numbers, the statement of Theorem 1 holds with the lower 
bound 2 

Similarly for the length of proofs of sentences in Th(/£). 

Theorem 4 . There exists a constant d > so that for every axiomatiza- 
tion AX for Th(^?) the statement of Theorem 2 holds with the lower bound 2 . 

Corollary 5 . The theory of addition and multiplication of reals (Tarski's 
Algebra [3]) is exponentially complex in the sense of Theorems 3 and 4. 
(This result was obtained independently by V. Strassen.) 

Ferrante and Rackoff [1] describe decision procedures for Th($.) and PA 

dn 
en JL 
which run in deterministic space 2 and 2 (and hence in deterministic 

dn 

2 cn 2 
time 0(2 ) and 0(2 )), respectively, for certain constants c and d. 

Any substantial improvement in our lower bounds would settle some general 

open automata- theoretic questions on the relation between time and space. 

n 2 
For example, a lower bound of time 2 for the decision problem for Th(ft) 



en 
would give an example of a problem doable in space S(n) = 2 but not in time 

bounded by a polynomial in S . 

Variations of the methods employed in the proofs of Theorems 1-4 lead 
to complexity results for the (decidable) theories of multiplication of 
natural numbers, finite Abelian groups, and other classes of Abelian groups. 
Some of these results are stated in Section 7 and will be presented in full 
in a subsequent paper. 

The fact that decision and proof procedures for such simple theories are 
exponentially complex is of significance to the program of theorem proving by 
machine on the one hand, and to the more general issue of what is knowable in 
mathematics on the other hand. 

2. Algorithms . 

Since we intend to prove results concerning the complexity of algorithms, 
we must say what notion of algorithm we use. Actually our methods of proof 
and our results are strong enough to apply to any reasonable class of algo- 
rithms or computing machines. However, for the sake of def initeness, we 
shall assume throughout this paper that our algorithms are the programs for 
Turing machines on the alphabet {0, 1}. 

We proceed to give an informal description of these algorithms. The 
machine- tape is assumed to be one-way infinite extending to the right from 
an initial left-most square. At any given time during the progress of a 
computation, all but a finite number of the tape's squares contain 0. An 
instruction has the form: "i: If then print X Q , move M Q , go to one of 
i , i , ... ; if 1 then print X , move M , go to one of j 1 , j^, ... •" Here 
i, i , i , ... , j , j„, ... are natural numbers, the so-called instruction 



numbers; X and X are either or 1; and M Q and M^ are either R or L (for 

"move right" and "move left", respectively). 

The possibility of going to one of several alternative instructions 

embodies the non-deterministic character of our algorithms. Another type 

of instruction is: "i: Stop." Instructions are abbreviated by dropping 

the verbal parts. Thus, "3: 0, 1, L, 72, 5; 1, 1, R, 15, 3." is an example 

of an instruction. A program AL is a sequence 1^, ..., I n of instructions. 

For definiteness' sake we assume that the instruction number of I ± is i and 

that I is the instruction "n: Stop." Furthermore AL is assumed to be coded 
n 

in the binary alphabet {0, 1} in such a way that "Stop" also serves as an 
end-word indicating the end of the binary word AL. 

Let x e {0, 1} be an input word. To describe the possible computations 
by the algorithm AL on x, we assume that x is placed in the leftmost positions 
of the machine's tape and the scanning head is positioned in the leftmost 
square of the tape. The computation starts with the first instruction 1^. 

A halting computation on x is a sequence C = (I. , ... , 1^ ) of instructions 

1 m 

of AL so that i, = 1 and i = n. At each step 1 < p < m, the motion of the 

1 m 

scanning head, the printing on the scanned square, and the transfer to the 

next instruction I. , are according to the current instruction I ± . The 

X p+1 P 

length £(C) of C is, by definition, m. 

It is clear that a truly non-deterministic program may have several 
possible computations on a given input x. 



3. Method for Complexity Proofs . 

Having settled on a definite notion of algorithm, we shall describe a 

general method for establishing lower bounds for theories of addition which 

are formalized in L. We do not develop our methods of proof in their fullest 

generality but rather utilize the fact that we deal with natural or real 

numbers to present the proofs in a more readily understandable and concrete 

form. The refinements and generalizations which are needed for other 

theories of addition will be introduced in a subsequent paper. 

2 n 
Theorem 6 . Let f(n) be one of the two functions 2 or 2 Assume for 

a complete theory T that there exists a polynomial p(n) and a constant d > 

so that for every program AL and binary word x, there exists a 

sentence F with the following properties. 

AJ_l y X 

(a) F e T if and only if some halting computation C of AL on x 

ALi ^ X 

satisfies £(C) < f (|x|). 

(b) |F &T | < d-(|AL| + |x|). 

(c) ~F is Turing machine calculable from AL and x in time less than 

AL,x 

P ( I AL | + |x|). 
(We recall that all our objects such as F, AL, etc., are binary words, and 
that jw | denotes the length of w.) 

Under these conditions, there exists a constant c > so that for every 
decision algorithm AL for T there exists a number n Q = n Q (AL) so that for 
every n > n n there exists a sentence a e T such that \o\ = n and every compu- 
tation by AL for deciding O takes more than f(cn) steps. Furthermore 
n Q (AL) = 0(|AL|). 



Proof . There exists a number c > and an m Q so that for m > m Q we 
have 

p(2m) + f(c-(2dm + 1)) < f (m) . (D 

Namely, let c < l/(2d) and recall that p(n) is a polynomial, whereas f(n) 

9 n 
n „z 
is 2 or 2 

Let AL be a (non-deterministic) decision algorithm for T. We construct 
a new algorithm AL as follows. We do not care how AL Q behaves on an input 
word x which is not a program. If x is a program, then AL Q starts by 
constructing the sentence F = ~F . The program AL Q then switches to AL 
which works on the input F. If AL stops on F and determines that F e T, 
then AL halts; in all other cases AL- does not halt. Thus, for a program 
x as input, AL halts if and only if the program x does not halt on the input 
x in fewer than f(|x|) steps. Note that by possibly padding AL Q with irrele- 
vant instructions, we may assume that m_ < I^LqI < I AL. | + k, where k is 
independent of AL. 

Denote the binary word AL„ by z and let O be the sentence ~ F Z>Z - F z>z 
cannot be true, for if it were true, then ~F would be false and AL Q would 
not halt on z, whereas the truth of F implies that z (= AL Q ) does halt on 
the input z (even in at most f(|z|) steps), a contradiction. 

Thus, a is true and hence AL_ (= z) halts on z. The truth of a also 
implies that every halting computation of AL on z is longer than f(|z|). 

Let m = Izl. By (b) , we have 

n = |ct I < 2dm + 1. (2) 

Let t be the least number of steps that AL takes, by some halting computation, 



to decide O. By the definition of AL and the fact that fewer than p(2m) 
steps are required to find C = ~F from z (this follows from (c) and 

Z y Z 

|z| = m) , there is a halting computation of the program AL.. on z 
requiring fewer than p(2m) + t steps. By the truth of O, 

p(2m) + t > f (m) . 
Using (1) and (2), 

t > f(c-(2dm + 1)) > f(cn). 
Take n Q to be n = \a\. Then n_ < 2dm + 1 < 2d ( I AL I + k) + 1, so 
n n = ( | AL | ) . The fact that the result holds for AL and every n > n Q 
(with possibly a smaller constant c) is obtained by first padding AL. by 
irrelevant instructions, and then padding the resulting a by prefixing a 
quantifier 3x. of an appropriate length, where 13 x. I =1+ I j I . The 
details are left to the reader. □ 

For utilizing Theorem 6 we need a method for constructing sentences 

F. T with the properties (a) - (c) . One such method is provided by: 
AL, w 

Theorem 7 . Let <fi- = <A, +> be an additive structure such that N £ A, 

and on N the operation + is ordinary addition. Let f(n) again be one of the 

2 n 
functions 2 n or 2 . Assume that T = Th(<fl) is a theory of addition 

(formalized in the language L) for which there exists c > such that for 

every n and for every binary word w, |w| = n, there exist formulas I (y) , 

J (y) , S (x, y) and H (x) with the following properties, 
n n w 

(i) |S (x, y)| < en, |I (y) | < en, |J (y) | < en, and |H (x) I < en. 
n n n w 

o 

(ii) I (b) is true in <ft- for b e A if and only if b e N and b < f(n) . 
n 

J (b) is true exactly for b = f(n). 
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2 
(iii) S codes all binary sequences of length f (n) . Namely, for every 
n 

binary sequence 3 e (0, l}*, 131 = f(n) , there exists an a e A 
so that for i e N, < i < f (n) , S (a, i) is true in oft 
if g(i) = 1, and S (a, i) is false if 3(i) = 0, where for any 
sequence 3, 3(i) denotes the i+1 element of 3, < i < 131- 

(iv) H (x) is true for a e A if and only if the first f(n) symbols of 
w 

the sequence coded by a in the sense of (iii) has the form 
wO , p = f(n) - |w|. 

(v) S (x, y) , I (y) , J (y) and H (x) are Turing machine calculable 
n n n w 

from n and w in a polynomial number of steps. 

From such formulas S , I , J and H , a formula F with the properties 

n n n w aij , w 

(a) - (c) can be constructed, so that T satisfies the conclusion of Theorem 6. 

2 
Proof . We shall describe, by use of sequences of length f(n) , all 

possible halting computations of length at most f(n) of a program AL on an 

input w. Let C = (I. , ... , I. ) be such a computation. Assume that AL has 

1 1 X m 
k instructions; by our notational conventions every computation starts with 



the first instruction I., and the last instruction I, of AL is: "k: Stop. 

Thus in C, i n =1 and i = k. 
1 m 

Let us adopt the convention that after the stop instruction, the scanning 
head, the (stop) instruction, and the tape contents stay stationary and 
unchanged at all subsequent time instants. Since m < f(n), the scanning head 
never moves beyond f (n) squares from the initial left-most square of the tape. 
We assume also that the Turing machine never attempts to shift its head left 
off the beginning of the tape. 
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The progress of the computation C on the input w will be described by- 
stringing together f(n) instantaneous descriptions of the computation in 
the following manner. Let W. be the first (left-most) f(n) symbols of the 
tape at time j, 1 < j < f(n). Then the string W i W 2'"' W f( n ) = w £ i- > 1 * 

codes all the relevant information concerning the tape contents during the 

2 
computation C. We have |W| = f(n) . Also, W^ = W^^ - ... . 



To trace the motion of the scanning head and the sequence of instructions 

* Pj . n q j 
during the computation C, we define U. e {0, 1, ... , k} to be i. 

J -J 

where p + q. + 1 = f(n) and p. is the distance at time j of the scanning 
3 1 1 

head from the start square, 1 < j < f(n). Recall that i is the instruction 

number of the j instruction executed in C. Also i = ± m+ ^ = ••• = k, the 

2 
stop instruction. Put U = U i U 2 - ' - U f ( n ) ' We have ' U ' = f ^ " 

The fact that the pair (W, U) , where W e {0, 1} , U e {0, 1, . . . , k} , 
| w | = |U| = f(n) 2 , describes a halting computation of AL on w, is equivalent 
to a number of statements which say, roughly, that the first f(n) symbols are 
the initial configuration; that the transformation from a block of f(n) 
symbols to the next block is by an instruction of AL; and that U contains k 
(the number of the halting instruction). More precisely, (W, U) codes a 
halting computation of length at most f(n) of AL on w, where |w| = n, if and 
only if : 

(a) W(0)...W(f(n)-l) = w0 P , p = f(n) - |w|. 

(B) U(0)...U(f(n)-l) = 10 f(n)-1 . 

(Y) If U(i) = and i + f(n) < f(n) 2 , then W(i + f(n)) = W(i). 
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(6) If U(i) = q, i + f(n) + 1 < f(n) 2 , < q < k, W(i) = 0, and I is, 

say, "q: 0, 1, R, k^ ... , k ; 1, ...", then W(i + f(n)) = 1, 

U(i + f(n) + 1) = k ± or U(i + f(n) + 1) = k 2 or etc. Similarly 
for other instruction and tape-symbol combinations. 

(e) If, for f(n) < i < f(n) 2 , U(i) ^ 0, then exactly one of U(i - f(n)), 
+ 0, or U(i - f(n) - 1) ± 0, or U(i - f(n) + 1) t holds. Also, 
if U(i) i 0, then U(i ± 1) = U(i ± 2) = 0. 

(?) U(i) = k for some i, < i < f(n) 2 . If U(i) = k and i + f(n) 

< f(n) 2 , then U(i + f(n)) » k and W(i + f(n)) = W(i). 

From the assumption that (W, U) satisfies (a) - (?) , it can be proved 

by induction on 1 < j < f(n) that (W. +1 , U ,) is an instantaneous description 

which follows from (W., U.) by an application of the instruction I whose 

J J j 

number appears in U.. Also, (W . . , ,U r/ . ) is a halting instantaneous description. 

j r(n) 1 (n; 

* * 

Thus, the existence of a pair (W, U), W e {0, 1} , U e {0, 1, ... , k} , 

2 
| W 1 = |U| = f(n) , which satisfies (a) - (?) is a necessary and sufficient 

condition for the existence of a halting computation C on w with Jl(C) < f(n). 

Conditions (i) - (v) provide means for making statements about arbitrary 

2 2 

(0, 1) sequences of length f(n) , about integers < i < f (n) , and about the 

integer f (n) , all by use of formulas of L of size 0(n). Also, the ordinary 

2 
ordering < on N restricted to integers of size less than f(n) can be expressed 

by the length 0(n) formula 

x < y *-+ 3z[I (x) a I (y) a I (z) a x + z = y]. 
n n n n 
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Hence, the existence of (W, U) satisfying (a) - (£) can be expressed by a 

sentence F = F with the desired properties (a) - (c) . Namely, express 
AL,w 

0, 1, ... , k in binary notation by words of equal length p = |k|. Then, 

via S (x, y) , a single element a e A exists which codes W, and elements 

a a e A code U. The sentence F will start with quantifiers and 

1 P 

relativization : 

f = 3x3x 1 ... 3x Vy Vz [i n (y) A J n ( z ) 

+E a A Eg A E y A E fi A E £ A E c ]. 

x codes the sequence W and x. , . . . ,x together code the sequence U. The 

1 p 

clauses E ... E express the corresponding conditions (a) - (£)• Thus, 
a £ 

for example, E is H (x) ; E fl is H (x.. ) a H (x ) a ... a H (x ) , where 
r a w p u i 2 p 

u = 10 n_1 and u. = n , 2 < j < p; and E is 

[ ~ S n (x l' y) A •■* A ~ S n (x p' y) A X n (y + z) 

-> [S (x, y + z) *-+ S (x, y)]]. 
n n 

The reader can supply the details of the construction of the remaining 

expressions Eg, E £ and E and verify that, altogether, the F^^ thus formed 

satisfies (a) - (c) of Theorem 6. D 



4. Proof of Theorem 3 (Real Addition). 

We start by showing that for the theory Th(^) of real addition, there 

exist formulas S (x, y) , I (y) , etc. as postulated in Theorem 7 with f(n) = 2 , 
n n 

thereby proving Theorem 3. Several of the results in this section will play 



14 



later on a role in the proof for PA. 

Let F(x, y) be any formula and consider the conjunction 
G = F(x , y ) a F(x 2 , y 2 ) a F(x 3> y 3 >. 

It is readily seen that G -*->• G where 

G = Vx Vy [((x - x a y = y^ v (x » x 2 a y = y 2 ) 

v (x = x_ a y = y 3 >) -»■ F(x, y)]. 

Note that |G| = 3-|F(x, y) | , whereas | G^ | = |F(x, y) I + c, where c is inde- 
pendent of F(x, y). A similar rewriting exists for formulas F with more 
than two variables and for conjunctions of more than three instances of F. 
The above device is a special case of a more general theorem due to M. Fischer 
and A. Meyer. It was discovered independently by several people including 
V. Strassen. 

Theorem 8 . There exists a constant c > so that for every n there is 

a formula M (x, y, z) of L such that for real numbers A, B, C, 

2 n 

M (A, B, C) is true +->-AeNAA<2 aAB = C. 
n 

Also IM (x, v, z)| < c(n+l) and M (x, y, z) is Turing machine computable from n 
in time polynomial in n. 

Proof. The construction of M (x, y, z) will be inductive on n. For 
n 

2° 
n = we have 2 =2 and we define M Q (x, y, z) as 



[x=0a z =0]v[x=1az= y]. 
From M, we get M, - by observing that x e N and x < 2' if and only if 



k+1 
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2 k 
there exist x.. , x_, x , x, e N all less than 2 so that x = x x + x„ + x, . 



For this decomposition we have 

z = xy = x^x^) + x 3 y + x^y. 
Hence, M, . (x, y, z) is equivalent to 

3u 1 u 2 ...u 5 x 1 ...x 4 ( \( x i' x 2' u l^ A \^*2' y ' U 2^ A *V X 1' U 2' U 3^ 

A ^^3' y ' *V A \(*£ i ' y ' u 5^ A x = u l + x 3 + x 4 
a z = u_ + u, + u, ] . 

(Strictly speaking, a triple sum such as u.. + x + x, should be written as a 
chain of sums of two variables, but we shall not do it here.) Now, |M, .. | 
> 5 | M, |, which will not do. However, by using the device preceding the 
theorem, the five occurrences of M, can be replaced by a single occurrence 
to yield M, . Thus, |M, . (x, y, z) | < |M, (x, y, z) | + c for an appropriate 
c > 0. Hence, |M (x, y, z) | < c(n+l). (We assume c is chosen large enough 
so c > |M (x, y, z) | .) 

Actually, for the above bound to hold, it is necessary to show that 

the number of distinct variable names in M does not grow with n, for to 

n 

encode one of v variables requires (on the average) a string of length 
0(log v) . In fact, 15 different variable names are sufficient to express 
M . This is because the new variables introduced in constructing M, - 
from M, need only be distinct from each other and from the free variables 
of M, ; however no difficulty arises if they coincide with variables bound 
inside M, . A closer look at the construction of M.., shows that 12 new 
variables are introduced, which must be distinct from the three free variables 
of M, , giving a total of 15 distinct names needed. D 
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Corollary 9 . The formula M (x, 0, 0) is true for a real number x if 
and only if x e N and x < 2 



.2 n 



j 11 n 

The natural numbers x < 2 code all binary sequences of length 2 . 

Namely, write x in binary notation 

x = x(0) + x(l)-2 + ... + x(2 n -l)-2 " . 
We use the function 2 1 to obtain element x(i) of x. 

Theorem 10 . There exists a formula Pow n (x, y, z) such that for integers 

-2 n 



a, b, c for which < a, b a , c < 2 , Vovr^a, b, c) is true if and only if 
b a = c. Also, |Pow (x, y, z) | < d(n+l) for an appropriate d > and all n. 

Proof . Construct, by induction on k, a sequence E k (x, y, z, u, v, w) 

of formulas with the property that for integers a, b, c for which 

k ~n 

< a < 2 , < b a , c < 2 and real numbers A, B, C, E k (a, b, c, A, B, C) 

2 n a 
is true in <R, +> if and only if A e N, A < 2 , b = c, and AB = C. Thus, 

E, has M built into it since 
k n 

E k (0, 1, 1, A, B, C) +-* M n (A, B, C) . 

The case k = is given by 

t(x = a z = 1) v (x = lAz=y)]A M n (u, v, w). 



To 



obtain E, ^(x, y, z, u, v, w) from E , we again use the decomposition 



,k+i 



2 
x=xx +x + x, of every integer < x < 2 in terms of integers 

2 k 
< x , x , x 3 , x 4 < 2 . Then we have 
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X . X1-X2 X3 XI* 

y = (y ) -y -y • 

X 1 X 9 

Now, y Xl is expressed by a z such that E^x^ y, z ± , 0, 0, 0); then (y ) 
is a z 2 such that \(* 2 , * v ^ r 0, 0. °) etc - Whenever we have to write a 

product such as X] x or (y Xl ) X2 -y X3 , we use the formula E k (0, 1, 1, u, v, w) . 

In this way we can write the formula E k+1 (x, y, z, u, v, w) . Using the usual 

device of contracting a conjunction of instances of E fc into one occurrence, 

we see that I EL , J < |E. I + d for some d > 0, and hence |E I < d(n+l) + c(n+l), 
k+1 k " 

where c(n+l) is the bound on the length of M . As before, only a bounded 
number of variable names are needed. 

Recalling the definition of E k (x, y, z, u, v, w) , we see that 

Pow (x, y, z) «-»- E (x, y, z, 0, 0, 0) 

n n 

has the desired properties. □ 

Theorem 11 . There exists a formula S n (x, y) of L which for x, y e R is 

2 2n 2n 
true in <R, +> if and only if x and y are integers, x < 2 and y < 2 , and 

the y+l St digit x(y) of x, counting from the low-order end of the binary 
representation of x, is 1. The formula S n (x, y) satisfies the conditions of 
Theorem 7 for f(n) =2 . 

Proof . That x and y are integers in the appropriate ranges is easily 
expressible by formulas of size 0(n). Recall that for the integers which 

2 2n 
satisfy M„ (x, 0, 0) , i. e. < x < 2 , the ordering < is expressible by a 
2n 

formula of length 0(n). 
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y y+i 

Now x(y) = 1 if and only if there exists an integer z, 2 < z < 2 
so that x > z and 2 y+1 divides x - z. This fact is easily expressible by 
a formula S (x, y) of L using Pow and M, 



2n 2n* 

That formulas I (y) and J (y) with the properties listed in Theorem 7 
exist is immediate. Thus to finish the proof of Theorem 3 we need the 
following. 

Theorem 12 . For every binary word w, |w | = n, there exists a formula 

2 2n 

H (x) of L which is true in <R, +> for an integer < x < 2 if and only 
w 

if x(0)...x(2 n -l) = wO P , p = 2 n - n. The formula H w (x) satisfies the condi- 
tions of Theorem 7. 

Proof . Define for binary words u, by induction on |u|, formulas K u (z) 
as follows. 

K Q (z) +-* z = 0, 

Kj_(z) +-*■ z - 1, 

K n (z) +-*3y[ K (y) a z = y + y ], 
u0 u 

K ul (z) «->3y[ K u (y) Az = y + y + l ]. 

Clearly, if K^(z) is true, then, considered as a sequence, z satisfies 

w(i) = z(i) for < i < |w|, z(i) = for i > | w | . Using this K w (z) and tht 

formulas S (x, y) and J (y) , we can write the formula H (x) by 

formally expressing the statement that for z such that ^(z) , x(i) = 

z(i), < i < 2 n . D 
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Thus we have proved, for Th(/£), the existence of formulas S n (x, y) , 

I (y). J (y) » and H (x) which satisfy the conditions of Theorem 7 for 
n n w 

f(n) = 2 n . This completes the proof of Theorem 3. 



5. Proof of Theorem 4 (Lengths of Proofs for Real Addition). 

We now show that for Th(/t) proofs are also exponentially long. This 
is an easy consequence of Theorem 3. 

Let AX be a consistent system of axioms which is complete for Th(X)> 
i.e. every sentence F e Th($) is provable from AX (AX (- F) . Furthermore, 
there exists an algorithm B which decides in polynomial time p(|G|) for a sen- 
tence G of L whether G e AX. 

Let c be the constant of Theorem 3. For every polynomial q(x), there 

/n dn N „cn 
exists a constant < d so that from a certain point on, q(Z ; < 2. 

Construct a non-deterministic algorithm AL for Th(R) as follows. 

Given a sentence F, AL writes down (non-deterministically) a binary sequence 

P. Then AL checks whether P is a proof of F from AX or a proof of ~F from 

AX. The computation halts only if one of the two possibilities occurs. 

Because of the assumptions on AX, this check can be made in a polynomial 

number of steps h(|P|). Thus the whole computation, if it halts, requires 

|P | + h(|P|) = q(|P|) steps. If every true sentence F would have a proof P 

with |P| < 2 n where n = |F|, then for every such F there would be some halting 

computation of length less than q(2 ), i.e. also less than 2 for all 

sufficiently large n, a contradiction. 
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6. Proof of Theorems 1-2 (Presburger Arithmetic). 

The proof for Theorem 1 follows closely along the lines of the proof 

of Theorem 3 and utilizes our previous results. In particular we note that 

Theorems 8-10 apply, as they stand and with the same proofs, to PA. Note 

also that the order < on N is definable in PA using +. Throughout this 

2 n 
section, let f(n) be 2 

2 2 n+1 
Theorem 12 . There exists a function g(n) > 2 W - 2 so that for 

every n there exists a formula Prod (x, y, z) with the following properties. 

For integers A, B, C, 

Prod (A, B, C) is true in ?7 «-* A, B, C < g(n) and AB = C. 

There exists a constant c > so that |Prod | < c(n+l) for all n. The formula 

Prod is Turing machine constructible from n in time polynomial in n. 
n 

Proof . We shall use the Prime Number Theorem which says that the number 

of primes smaller than m is asymptotically equal to m/log m; hence bigger than 

2 n+2 
m/log m for all sufficiently large m. Thus, for m = 2 , the number of 

„n+2 „ 2 n+l „ 
primes p < m exceeds 2 /2 n > 2 = f(n) . Let g(n) = II p, where p 

p<m 

2 n+2 f(n) 2 

runs over primes, m = 2 ; then g(n) > 2 since 2 < p for all primes. 

By use of the formula M +? (x, y, z), we can write two formulas 

Res ,„(x, y, z) and P ,_ (x) of length 0(n) with the following meanings. Let 
n+2 n+Z 

res(x, y) denote the residue (remainder) of x when divided by y. Then 

2 n+2 
Res +2 (x, y, z) «->- [ y < 2 a res(x, y) = z ]; 
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2 n+2 
P + (x) +-y [ x < 2 and x is prime ]. 

The formula Res , „ is written in L as 
n+2 

z < y A 3q 3w[ M (y, q, w) A x = w + z ] . 

2 n+2 
We recall that for any q and w, M (y, q, w) holds if and only if y < 2 

and yq = w. 

The formula P (x) is, simply, 

M n+2 (x ' °* 0) A V y^ z t M n+ 2 (y ' z»x)^[y = lv y = x ]]. 

By formally saying that x > 1 is the smallest integer divisible by all 

2 n+2 
primes p < 2 , we can write a formula G (x) which is true precisely for 



x = g(n). Now Prod (x, y, z) is true if and only if 

2 n+2 
x, y, z < g(n) a Vu[ u < 2 -* res(x,u) -res(y,u) = res(z,u) ]. (3) 



2 n+2 
Namely, this implies that xy = z (mod p) for all p < 2 , which together 

with x, y, z < g(n) is equivalent to xy = z. Now, by use of G , „ (x) , 

n+2 

M +2 ( x » y> z ) and Res +2 (x> Y> z ) > tne above relation (3) can be expressed 

by a formula Prod with the desired properties. D 

Exponentiation can be defined just as in the proof of Theorem 10 except 
that we now use Prod (x, y, z) instead of M (x, y, z) to obtain a sequence of 
formulas E'(x, y, z, u, v, w) . For integers a, b, c, A, B, C for which 

2 k 
< a < 2 and < b , c < g(n), E'(a, b, c, A, B, C) is true in t| if and 

only if A, B, C < g(n), b a = c, and AB = C. Also |E'| = 0(n). 

n 
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Having now multiplication up to g(n) and exponentiation 2 up to 

? n+l 
i < 2 expressed by formulas of length 0(n), we can code sequences ot 

? n+l „ , 

length 2 = f(n) in exactly the same manner as in Section 4. This com- 
pletes the proof of Theorem 1 by again appealing to Theorem 7. 

The proof of Theorem 2 now follows exactly the lines of the proof of 
Theorem 4 given in Section 5. 



7. Other Results . 

The techniques presented in this paper for -proving lower bounds on logical 
theories may be extended in a number of directions to yield several other 
results. We outline some of them below without proof; they will be presented 
in full in a subsequent paper. 

Theorem 13 . Let 01 be any class of additive structures, so if 
<R. = <A, +> e Ot, then + is a binary associative operation on A. Let Th(fft) 
be the set of sentences of L valid in every structure of Ot. Assume Ot has 
the property that for every k e N, there is a structure u# k = <A^, +> e (% 

and an element u e A such that the elements u, u+u, u+u+u, ... , k>u are 
distinct. Then the statement of Theorem 1 holds for Th«7D with the lower 
bound 2 for some d > 0. 

Theorem 3 is an immediate corollary of this result, taking Ot to be the 
class of just the one structure &= <R, +> . Some other classes to which the 
result applies are: 
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(1) The complex numbers under addition. 

(2) Finite cyclic groups. 

(3) Rings of characteristic p. 

(4) Finite Abelian groups. 

(5) The natural numbers under multiplication. 

The proof of Theorem 13 extends the ideas of Section 4. The element n-u 
is used as the representation of the integer n, and u itself is selected by 
existential quantification. 

Special properties of certain theories permit us to obtain still larger 

lower bounds on the decision problem. For example, we get a lower bound of 

2 en 
2 for (4), the theory of finite Abelian groups. This is obtained by encoding 

2 n 
integers up to 2 by formulas of length 0(n) just as in Theorem 13, but 

instead of representing a sequence by an integer, we let the structure 

itself encode the sequence. Let h be a finite Abelian group. Then element 

s(i) of the sequence s encoded by <7 is 1 if and only if Jo contains an element 

st 
x of order p., where p. is the i+1 prime. The necessity of using primes 

as indices instead of integers considerably complicates the analog of Theorem 7. 

Another example where we get still larger bounds is (5) , the theory of 

multiplication of the natural numbers (MULT) . That MULT is at least as hard 

as PA is immediate, for the powers of 2 under multiplication are isomorphic 

to f7 , and the property of being a power of 2 can be expressed in MULT (assuming 

we have the constant 2; otherwise we use an arbitrary prime). In fact, the 

~cn 

2 
bound can be increased yet another exponential to 2 by using the encoding 
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which associates a sequence s to a positive integer n, where s(i) = 1 if and 
only if q ± divides n, where q. is the i+l St prime in some fixed (but arbitrary) 
ordering of the primes. Again we are forced to use the primes as indices, 
and again the analog to Theorem 7 is considerably complicated. 
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